Index Theorems for Polynomial Pencils 



Richard Kollar, Radormr Bosak 

Department of Applied Mathematics and Statistics 
Faculty of Mathematics, Physics, Informatics, Comenius University 
Mlynska dolina, 842 48 Bratislava, Slovakia 
E-mail: kollar@fmph.uniba.sk 

December 27, 2012 

Abstract 

We survey index theorems counting eigenvalues of linearized Hamil- 
tonian systems and characteristic values of polynomial operator pencils. 
We present a simple common graphical interpretationand generalization of 
the index theory using the concept of graphical Krein signature. Further- 
more, we prove that derivatives of an eigenvector u — u(A) of an operator 
pencil £(A) satisfying £(A)u(A) = /i(A)m(A) evaluated at a characteristic 
value of £(A) do not only generate an arbitrary chain of root vectors of 
jC(X) but the chain that carries an extra information. 

1 Introduction 

Spectral problems naturally arise in investigations of stability and decay rates 
of nonlinear waves and stability of numerical schemes, in solving intcgrable 
systems via inverse scattering method, and in multiple other fields. The main 
interest is in presence of point spectrum with a positive real part (unstable 
eigenvalues) that corresponds to destabilizing modes. Over the last 40 years 
counts of unstable eigenvalues and other related counts that we refer to as 
index theorems appeared across various distinct and unrelated fields due to their 
simple structure and importance for applications. Here we briefly survey index 
theory literature, point out its common graphical interpretation, and derive its 
generalization to problems with arbitrary structure of the kernel of the operators 
involved. 

Linearized Hamiltonian systems. Index theorems proved to be particularly 
useful in spectral stability theory of waves in Hamiltonian systems where one 
studies spectrum a(JL) of the non-selfadjoint problem 

JLu^vu, J=-J*, L = L* . (1) 

Here J and L are operators acting on a Hilbert space X, Lis the second variation 
of the underlying Hamiltonian, u e X, and L* denotes the adjoint operator of L 



[7TJE2]- Problem pj appears in search for exponentially growing or decaying so- 
lutions v(x, t) — e u(x) of the linearized Hamiltonian system vt — JLv around 
the equilibrium vo = vo(x, t) of the (nonlinear) Hamiltonian system with v(x, t) 
representing an infinitesimal perturbation of v . The wave Vq is said to be spec- 
trally stable if has no solution with Re v > for v e X . Due to the natural 
symmetry of spectrum a(JL) (see 02]) the spectral stability is equivalent to the 
confinement of a(JL) to the imaginary axis. While positivity of the spectrum 
of L implies spectral stability, the operator L often has negative eigenvalues, 
and due to the symmetries of the system also a non-trivial kernel opening up an 
opportunity for instability in the system. However, the symmetries through the 
Nocther theorem imply existence of conserved quantities (typically correspond- 
ing to physically meaningful quantities as mass, momentum, etc.) and their 
conservation restricts possible degrees of freedom in the system and thus can 
prohibit instability in cases of indefinite L. The index theorems for linearized 
Hamiltonians relate the spectra of L and JL, and particularly the number of 
negative real eigenvalues of L and unstable eigenvalues of JL. Analogous index 
theory was developed for quadratic operator pencils. The link between these 
two types of results is that for invertible J it is possible to reformulate ^ as a 
linear operator pencil. Therefore, both types of index theorems can be viewed 
as special cases of a general theory for operator pencils. 

Two different ways to interpret the index theorems mathematically can be 
found in the literature. Motivated by the work of Hestenes [25] (see also [20] ) 
Maddocks EH ES] derived the dimension counts for restricted quadratic forms 
in finite dimensional spaces and showed how the question of stability of an 
equilibrium of a Hamiltonian system reduces to a question whether a quadratic 
form is positive when restricted to a particular subspace of its domain. Such an 
approach was later used in works [2J] EE1 [32J [151 [27] |3] and it is closely related to 
the theory of indefinite inner product spaces [3 [2pJ EI] . ft provides a geometric 
visualization of the index theorems as counts of dimensions of an intersection 
of a negative energy cone associated with the indefinite quadratic form with 
the subspace spanned by normal vectors (under the indefinite inner product) to 
hyperplanes tangential to surfaces of conserved quantities (Fig. [I] left panel) . 

However, here we focus on an alternative point of view of a different ge- 
ometrical (graphical) nature El HIJ [52]. We interpret the index theorems as 
topological counts of curves of eigenvalues of operator pencils in a plane (Fig. [T] 
right panel). We believe that such an interpretation provides not only a simpler 
visualization of the theory but also an easier way to generalize it. More surpris- 
ingly, as we will show, it also yields reduced algebraic formulae for calculation of 
the indices of operators with complicated generalized kernels because the chain 
of root vectors generated by the graphical method carry an extra information 
compared to a (generic) chain of root vectors. 
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Figure 1: Visualization of the index theory. Left panel: Algebraic approach. 
The equilibrium is (spectrally) stable if and only if the normal vector (in the 
associated indefinite inner product space) to the plane Cu = corresponding 
to the invariant (conserved quantity) of the system lies in the negative energy 
cone C = {ue X, (u, Lu) < 0} [58]. 

Right panel: Graphical approach. Plot of eigenvalue branches fi = /i(A) (the 
point spectrum u p {L — XK)) of the eigenvalue pencil (L — \K)u — //it. Purely 
imaginary eigenvalues v of JL correspond to intercepts of with the axis 
fi = via A = if (indicated by full circles). Their Krein signature is given by 
the sign of //(A) [42]. 

2 Krein Signature 

Index theorems often refer to Krein signature of eigenvalues (characteristic val- 
ues), a quantity that characterizes ability of the (stable) eigenvalue to become 
unstable under a perturbation [48j . In linearized Hamiltonian systems the Krein 
signature k_l(A) of an eigenvalue v of JL captures the signature of the quadratic 
form (•, L •) representing the linearized energy on the invariant subspace spanned 
by eigenspaces corresponding to (v, —V) (see MacKay [57] for the geometric visu- 
alization). Krein signature is also referred to as sign characteristics 19J within 
the context of operator pencils and symplectic signature in Hamiltonian me- 
chanics (see Kirillov [38 for a detailed discussion of the terminology, literature 
survey, and extension of the results of [57] ) . If the signature of the energy on 
the subspace is indefinite, the Krein signature is said to be indefinite, otherwise 
it is definite (positive or negative). It is easy to see that the Krein signature 
of non-purely imaginary and non-semi-simple purely imaginary eigenvalues is 
indefinite 68, 29]. On the other hand, the signature of any simple non-zero 
purely imaginary eigenvalue of ([lj is definite [25]. 

If J is invertible it is useful to define K — (?J) _1 and to note that K is 
sclf-adjoint. Then Lu — ivKu for a simple purely imaginary eigenvalue v of JL 
with the eigenvector u and 

'■— sign(Lw, u) — signii/(u, Ku) . 
Thus the sign of (u,Ku) agrees with up to the sign of iu and one can 
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define 

kk (A) := —(it, Ku) , for A := iv, Lu — XKu. (2) 

Since the definition ^ is closely related to its graphical analogue, we will drop 
the index K in ^ in the rest of the paper with exceptions at places where it 
is necessary to avoid a confusion. Due to the rotation A = iv the main interest 
lies in Krein signature of A £ M, so we limit ourselves to a definition of Krein 
signature of real eigenvalues of ([!]). An analogous definition can be given for 
the Krein indices and signature of a real characteristic value of a polynomial 
self-adjoint operator pencil E2] . 

Definition 1. Let J be an invertible skew-adjoint and L be a self-adjoint oper- 
ator on the Hilbert space X and let Xq be a real eigenvalue of iJL with hi one of 
its corresponding chains of generalized eigenvectors. Let K = (iJ)^ 1 and let W 
be the (Hermitian) Gramm matrix of the quadratic form (■, — K ■) on the span 
of hi. The number of positive (negative) eigenvalues of W is called the positive 
(negative) Krein index of hi at Ao and is denoted K + (hi,Xo) (k~(U,Xq)). The 
sums of n^ipl, Ao) over all (maximal) chains of generalized eigenvectors hi of Xq 
are called the positive and negative Krein indices of Xq and are denoted ^(Xq). 
Finally, k(U,Xo) := k + (U,Xq) — k^{U,Xq) is called the Krein signature of the 
chain U for Ao, and k(Xq) := k + (Ao) — k~(Xq) is called the Krein signature of 
A . 

Consider the spectrum of the operator pencil C(X) := L — XK, i.e., the set 
of fi = ju(A) for which there exists u £ X such that 

C(X)u = [iu . (3) 

The real point spectrum of iJL corresponds one-to-one (including multiplicity) 
to the set of real characteristic values of £(A), i.e., to Ao £ R such that £(Ao) 
has a non-trivial kernel [19l [61] [42] . Under suitable assumptions the eigenvalues 
jtt(A) and eigenvectors u(A) can be chosen to be real analytic in A [12] and it is 
possible to define the graphical Krein signature for a self-adjoint operator pencil 

03111]. 

Definition 2. Let C(X) be a self-adjoint operator pencil and assume that C 
has an isolated real characteristic value Xq and there are real analytic eigen- 
value branches /i(X) of C{X) such that eigenvalues of C(X) for X close to Xq are 
identical to fi(X). Let /i = //(A) be one of the branches with fi^ n \Xo) = for 
n = 0,l,...,m-l, and M (m) (A ) ^ 0. Let T](fi) := sign(^(" l )(A )) = ±1. Then 
the quantities 

Kg O,A ) := | l™' 

are called the positive and negative graphical Krein indices of the eigenvalue 
branch /! = /x(A) corresponding to the characteristic value Xq. The sums of 
Kg(/i, Aq) over all eigenvalue branches crossing at (A, fi) = (Aq,0) are called 



for m even, , . 

(4) 

for m odd, 
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the positive and negative graphical Krein indices of Ao and are denoted K^(Ao). 
Finally, ft g (/x, Ao) := Kg(p, Ao) — Kg (/x, Ao) is called the graphical Krein signature 
of the eigenvalue branch jj, = /x(A) vanishing at Ao, and K g (Ao) := k+(Ao) — 
K g (Ao) *s called the graphical Krein signature of Xq. 

Definition[2]extends to general self-adjoint operator pencils as long as smooth 
eigenvalue and eigenvector branches exist in a neighborhood of an isolated char- 
acteristic value Ao |42| . The fundamental relation between the Krein signature 
and the graphical Krein signature for real A [3H [5TJ [5] is given by 

« g (A) = K Jf (A). (5) 

The Krein signature k(X) then can be read off the graph of spectrum of £(A) 
and the chain of generalized eigenvectors of iJL corresponding to Ao can be 
generated by derivatives of the eigenfunction branch u(A) corresponding to the 
eigenvalue n(X) of {§ (Sfl E2 E] • 

3 Index Theorems for Linear Pencils and Lin- 
earized Hamiltonians 

We introduce the following notation. Let X and Y be separable Hilbert spaces 
and let A be a densely defined operator D(A) C X — >• Y. We denote cr p (A) 
the point spectrum of A and n uas (A) the unstable index of A counting the 
number of unstable eigenvalues of A, i.e. the number of points in o~ p (A) n 
{Re(z) > 0}. Furthermore, let p(A), z(A), and n(A) be, respectively, the 
counts of, repsectively, positive, zero, and negative real eigenvalues of a self- 
adjoint operator A counting (geometric) multiplicity, allowing the counts to be 
infinite. 

In 1972 Vakhitov and Kolokolov [73] studied spectral stability of stationary 
(in an appropriate reference frame) solutions (j)^ of a nonlinear Schrodinger 
equation parameterized by angular velocity uj. Their linearized stability is 
characterized by the spectrum of the eigenvalue problem 0. The Vakhitov- 
Kolokolov criterion states that if L± are self-adjoint operators, L + non-negative, 
a{L_)nR- ={-X Q }, 




then 

n un s{JL) = n(L) — n(dl/duj) . (7) 

Here n(L) — 1 and I(uj) — J (jx^dx is the energy of the stationary solution, 
i.e., dl/du is a scalar. The quantity dl/dto can be related to the sign of the 
derivative D'(X) at A = of the Evans function [SS] and to the quadratic form 
(•, L-) evaluated at the generalized kernel of JL corresponding to the eigenvector 
(pu . Thus the count of unstable eigenvalues of JL depends on the number related 
to gKer( JL). 
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Under the assumption of the full Hamiltonian symmetry {A, —A, A, —A} of 
(Tp(JL) it is straightforward to generalize ^ to the parity index theorem: 
n uns (JL) — (n(L) — n(D)) is an even and non-positive integer, where n(D) is 
the count of negative eigenvalues of a matrix D related to gKer( JL) (see The- 
orem [l]) . This parity index theorem was proved in 1987 in celebrated papers 

[5S] by Grillakis, Shatah, and Strauss who also proved the connection of the 
spectral stability to nonlinear stability for a wide class of problems (see also 
[70l Chapter 4] and [67] for the survey of the stability results in the context 
of nonlinear Schrodinger and KdV equations). The generalization of the par- 
ity index theorem for the operators related to stability of dispersive waves was 
proved by Lin [55] . The parity index theorem plays an important role in the 
theory of gyroscopic stabilization where it states that if the degree of instabil- 
ity (negative index) of the system is odd the the equilibrium point cannot be 
stabilized by gyroscopic force. This fundamental theorem is often referred to as 
Thomson theorem [71 DUST], Thomson- Tait-Chetaev theorem [63, Chapter 6] 
and Kelvin's theorem [35] (Lord Kelvin's original name was William Thomson). 
See the works of Kozlov [JU , Kozlov and Karapetyan [17] , Chern [T3J for 
further results, references, and discussion of its implications, and Kozlov |45) 
for the the topological implications of the theorem. 

It is easy to identify n uns (JL) — k r + 2k c , where k r is the number of 
positive real eigenvalues of JL and k c is the number of eigenvalues of JL in 
{z e C,Re(z) > 0,lm(z) > 0}. Furthermore, denote 

K = K ZW= k kW- 

isecr p (JL)niR,iu<0 \Ea p (iJL)nR~ 

In the case of the operator JL with all purely imaginary eigenvalues simple 
the number counts the purely imaginary eigenvalues of JL with positive 
imaginary part of negative Krein signature. The final form of the index theo- 
rem for linearized Hamiltonians was proved in 2004 independently by Kapitula, 
Kevrekidis and Sandstede [32] and Pelinovsky [66] (some assumptions of [66] 
were removed in [75]). 

Theorem 1 ( |32| 66]). Let J be an invertible skew- adjoint and L a self-adjoint 
operator acting on a Hilbert space X , with J^ 1 bounded on a subspace of X 
of a finite codimension, n(L) < oo, and o~(L) l~l {x € K, x < 0} C a p (L). 
Assume that the operators J and L satisfy (symmetry) assumptions that imply 
the full Hamiltonian symmetry of o~ p (JL) and the fact that all Jordan chains 
corresponding to kernel of JL have length two. Let V = gKcr(JL) Q Ker(L) and 
let D be the (symmetric) matrix of the quadratic form (•,£•) restricted to V. 
Then 

n uns (JL) = k r + 2k c = n(L) - n(D) - 2kJ . (8) 



Note that the count ([8| under the assumption z(L) — was already estab- 
lished in 1988 by Binding and Browne [5] Proposition 5.5] in the context of 
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Langer [54], and also in Bognar 



Sturm-Liouville theory. They considered the case of L semi-bounded with com- 
pact resolvent and J 1-to-l and used the standard perturbation theory combined 
with the graphical Krein signature referred to as two parameter spectral theory. 
Their beautiful short and simple argument is based on graphical inspection of 
eigenvalues branches (eigencurves) that can be interpreted as a homotopy in the 
parameter /zo from L + fiyl positive definite to L indefinite and counting of the 
eigenvalue branches intersections with the axis fi = (see also [5] ) . Furthermore, 
the claim and the proof of the Theorem I] is in some extent implicitly present 
in the theory developed by Iohvidov [29 
Section XL 4]. 

For the operators arising in the spectral theory of Sturm-Liouville problems 
with indefinite weight an index theorem ([8| plays an important role. While the 
upper bound of n uns (JL) is well understood [79l Theorem 5.8.2], the exact count 
and dependence of its individual factors on the coefficient of the underlying 
differential equation poses an important open problem [791 Problems IX-X, 
p. 300, Problem 1, p. 124, Comment (7), p. 128]. 

Kollar and Miller [32] proved the index theorem for selfadjoint matrix pencils 
that we further generalize in Section [I] and that implies a generalization of 
a finite-dimensional version of Theorem [I] For periodic Hamiltonian systems 
Kapitula and Haragus, [27] proved the analogue of ([8| using the Floquet theory 
(Bloch wave decomposition) under technical assumptions related to the Keldysh 
theorem that guarantees completeness of the eigenvectors for ([I]) . Some of the 
technical assumptions of [37] were later removed in [T7J. See also [TU] for an 
alternative proof of Q based on the integrable structure of the underlying 
problem. Recently, Stefanov and Kapitula |34j and Pelinovsky }67j removed 
the assumption of boundedness of J" 1 and proved ^ for the case covering the 
KdV-type problems with J = d x under the assumption dim(KerL) = 1 (see 
[57] for historical discussion of the stability results) . Chugunova and Pelinovsky 
[15] studied the generalized eigenvalue problem Lu — XKu using the theory of 
indefinite inner product spaces and particularly Pontryagin Invariant Subspace 
Theorem and proved counts (inertia laws) analogous to Furthermore they 
showed how ([!]) can be treated within that context and provided an alternative 
proof of 

In 1988 Jones .''>( ) and Grillakis [22 independently proved the index theorem 
bounding the number of unstable real eigenvalues of ([I]) from below for the 
systems with the canonical form Q. 

Theorem 2 ( |301 122] ). Let J and L have the canonical structure with L± 
self-adjoint on a Hilbert space X , Ker(L + ) _L Ker(L_), and let V denote the or- 
thogonal complement o/Ker(L + )©Ker(L_) in X with the orthogonal projection 
P :Y -+V. Then 

n uns ( JL) > k r > \n(PL + P) - n(PL_P)\ . (9) 



The proofs in [22] and in [30] are significantly different, with the method 
of Grillakis [22] related to the graphical Krein signature. Note that Theorem [2] 
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does not rely on completeness of the eigenvectors of JL. Theorem[2]is frequently 
used to establish instability of various nonlinear waves, particularly in situations 
when negative spectra and kernel of L± are explicitly known. 

Kapitula and Promislow [33J reproved Theorem [TJ using the theory of [S5] for 
constrained Hamiltonian systems and the Krein matrix theory, and reformulated 
Q, (|6| by inverting the operator L + reducing |TJ) to a generalized eigenvalue 
problem for which they established (pM) . They also proved a local count theorem 
analogous to Theorem [4] of Section ^J Note that Theorem [2] can be also easily 
obtained as a corollary of a general result of [42] (see Section [4]) by using the 
same reformulation as in [33J. Both counts ^ and ^ were in a more general 
context also derived by Cuccagna et al. in [IB] in the setup allowing eigenvalues 
embedded in the essential spectrum under some further technical assumptions. 
A lower bound for the number of real eigenvalues for Hcrmitian matrix pencils 
was derived by Lancaster and Tismenetsky [52 together with various other index 
theorems for perturbed Hermitian matrix pencils (including the upper bound for 
n uns ). Also, see Grillakis 23J for the analysis of the case n(PL + P) = n(PL-P). 

Quadratic eigenvalue pencils and properties of their spectra are a well- 
studied subject with a large number of applications (see [TBI [73], and references 
therein). Particular areas where index theorems naturally appear are Sturm- 
Liouvillc problems [5] and gyroscopic stabilization. The general framework for 
a study of gyroscopic stabilization and quadratic operator pencils in general is 
to study spectrum of the pencil £(A) = X 2 A + X(D + iG) + K + iN where the 
coefficients A, D, G, K 7 N are self-adjoint operators (see [37J E3J I3BJ and refer- 
ences therein) under various additional conditions for the coefficients. A survey 
of all important results in this area exceeds the scope of this paper and thus 
we list here only a few of references. Fundamental results for quadratic opera- 
tor pencils were obtained by Krein and Langer [321 HO] and later extended by 
Adamyan and Pivovarchik [TJ who also proved an index theorem similar to ([8]). 
Results that can be expressed in a form of an index theorem were obtained also 
by Lancaster and his coworkers [UJ [53] , by Wimmer [77] , and Chern [13] (see 
also reference therein). Important index theorems for systems with dissipation 
D > and partial dissipation D > were proved in [751 EBJ S3] • 

In the field of stability of nonlinear waves the index theorems for quadratic 
eigenvalue pencils are a fairly new subject. Chugunova and Pelinovsky |14j 
proved the count analogous to ([8| for the quadratic Hermitian matrix pencils 
of the form A 2 I + XL + M, where M has either zero or one dimensional kernel 
(under a further non-degeneracy condition) via an application of the Pontryagin 
Invariant Subspace Theorem. Their results were late reproved and extended 
by Kollar [UJ and Kollar and Miller [31] (see Example [TJ of this manuscript). 
Recently, Bronski, Johnson and Kapitula proved the count of a similar form 
to ([8]| for the quadratic operator pencils C(X) = A + XB + X 2 C, where A and 
C are self-adjoint and B is invertible skew-symmetric extending results of [53], 
[5J, and [SB]. 

Specific counts of eigenvalues for a particular class of Sturm-Liouville oper- 
ators given by JL with J = sign(a;) and L = —d 2 /dx 2 + V(x) in L 2 (M) were 
obtained in |35[ [2] . Various types of index and eigenvalue localization theorems 
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for definite and indefinite Sturm-Liouville problems, and particularly those that 
correspond to defective symmetric operators, can be found in ^ [5] and in mul- 
tiple reference therein, see also Binding and Volkmer [6] where non-real spectra 
of JL is studied with the use of graphical Krein signature. Further bounds par- 
ticularly related to graphical Krein signature and eigenvalue branches //(A) (see 
Section [4]) were derived in [5]. The local count referred to as the Krein oscil- 
lation theorem was proved within the context of index theorems by Kapitula 
[3"T] using the Krein matrix theory. An infinitesimal version of the (local index) 
Theorem [4] for matrices is proved in [19l Theorem 12.6]. 

The theorem guaranteeing existence of a sequence of eigenvalues converging 
to zero for a general class of operator pencils with compact self-adjoint non- 
negative coefficients was proved in |41) by a simple homotopy argument as a 
generalization of the results of [26] (see also references therein). A homotopy 
argument was also used by Maddocks and Overton [59] to prove the index 
theorem for dissipatively perturbed Hamiltonian system. Index theorems within 
the context of isoperimetric calculus of variations were proved in |21] . Bronski 
and Johnson [5] derived an index theorem for the Faddeev-Takhtajan problem 
by an approach analogous to work of Klaus and Shaw HO] on the Zakharov- 
Shabat system. Also, Kozlov and Karapetyan [47] established the index theorem 
for finite dimensional Hamiltonian systems that bounds the stable index of the 
system from below and connected the result to gyroscopic stabilization. 

To enclose the historical review of results on index theorems let us point 
out that an unusually large part of the work mentioned within this section can 
be traced back to the University of Maryland at College Park, where many of 
the papers were written and many of the ideas were born. J. H. Maddocks, 
I. Gohberg, L. Greenberg, C. K. R. T. Jones, M. Grillakis, R. L. Pego, and 
one of the authors of this manuscript (R. K.) were among the others who were 
involved in the development of the theory. 

4 Graphical Interpretation of Index Theorems 

The purpose of this section is to derive index theory that encompasses Theo- 
rems [l] and [2] and demonstrates their graphical nature. While Theorem [3] was 
derived in [42 our main results contained in Theorems [4] and [5] generalize the 
theory developed in [42] . The analysis is for simplicity performed for matrix 
pencils although the results under specific assumptions can be generalized to 
infinitely dimensional setting [5 . 

Definition 3. Let C = C(X) be a Hermitian matrix pencil real analytic in A with 
a real characteristic value \q. Let Z\_ = Z^_(C) (respectively = Z\ + (£)) 

denote the number (counting multiplicity) of eigenvalue curves /j = /i(A) of 
L with /i(Ao) = and /i(A) < for all A < Ao (respectively for A > \q) 
sufficiently close to A — A . Similarly, let Z^^ = Z^^C) and Z^^ = Z^^C), 
respectively, denote the number (counting multiplicity) of eigenvalue curves [i — 
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jtt(A) of C with /Lt(A) < for all A < (and A > respectively) with sufficiently 
large |A|. 

Although it is traditional to consider polynomial Hcrmitian matrix pencils 
the theory generalizes in a straightforward manner to real analytic Hermitian 
matrix pencils £(A), i.e., real analytic functions C : K — » C" xn for which £(A) 
is a Hermitian matrix for each A £ K. 

Theorem 3. Let C{X) be a real analytic Hermitian matrix pencil. Then the 
following counts hold 

(zi oo +4 oo )-2»(r(0))-(^.+4_) = J2 k ( a )- E k ( a )' ( 10 ) 



A<0 
AG<t(£) 



A>0 
AScr(£) 



7^ y-l 



^ + -^-) = E k ( a ) 



(ii) 



A^O 
AG<t(£) 



Proof. Consider the (parameter dependent) eigenvalue problem ([3]). According 
to the perturbation theory |36| the eigenvalue and eigenvector curves /i(A) and 
u(X) are analytic in A. Furthermore, let Q± be the quadrants in the (A, ^i)-plane 
(see Fig. [2]) . A simple count of curves entering and leaving Q± yields the counts 



0. 



Zi 00 (£) - n(C(0)) - Z^(C) - K ( A ) 

A<0,A6ct(£) 

zi co (£)-n(£(0))-Z^(£)+ £ «(A) = 

A>0,AGct(£) 



Then the sum and the difference of (12 1 and (131 give (10 1 and (11) 



(12) 
(13) 

□ 



A local version of the index theorem can be proved analogously. 

Theorem 4. Let C{X) be a real analytic Hermitian matrix pencil. Then the 
following local index theorem holds for any real Ai, A2 with X± < X2: 



n{C{X 1 ))-n(C{X 2 ))+z[ + -Z\_ = £ «(A) 



(14) 



Ai<A<A 2 
AG<t(£) 



It is easy to see that 

Z{- - Z[ + = Z{ + - z\_ = «(Ao) , (15) 

A A A Q A 

since the eigenvalue branches vanishing at Aq at even order do not contribute 



to the right hand side of (15). Then 



Z\_ + ZL = k(X ) + 2Z 



A+ ' 



z 



Z\_ = Z\ + + Z\ + = z(£(A )) . (16) 
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+ jti = ff p (jC(A)) 




4- W 



n(£(0)) 





Figure 2: Schematic plot of the proof of Theorem [3] Spectrum a p (C(X) orga- 
nized in eigenvalue branches /i = /i(A) vs. A. 

Moreover, if the matrix pencil C(X) has an extra structure then the terms 
Z^^C) can be determined by the perturbation theory. Let 



£(A)=L(A)- 5 (A)I, L(X) = j2\ k L k , <?(A) = ^A fe 9fe . 



(17) 



fc=0 



fc=0 



Here Lq, . . . , L„ are complex Hermitian matrices n x n and g$, . . . g q are real 
constants. There is a freedom of choice in an inclusion of identity multipliers in 
g(X) and L(A) but the index theorems only depend on the leading order term 
of £(A) and only differences of g(X) and L(X) are relevant. Therefore we ignore 
such an ambiguity in (17) and assume X p L p ^ X q g q . Since a(L(X)) « X p a(L p ) 
and g(X) « X q g q for |A| — > oo the values of terms ^4oo m ( 10 ) and |llj) depend 



on the leading order coefficients of g(X) and L(X). 

Theorem 5. Let C(X) be a real analytic Hermitian matrix pencil satisfying \1 
and let L p is invertible. Then 



p,q 


9q 




z l +00 (c) 


q > p, q even 


9g>0 


n 


n 


q > p, q even 


g q < o 








q > p, q odd 


9q > 





n 


q > p, q odd 


g q < o 


n 





q<p 




n((-l)PL p ) 


n(L p ) 


q=p 




n{(-iy(L p -q p I)) 


n(L p - q p l) 



Setting g(X) = in Theorems [3] and [5] one can recover generalizations of 
Theorems [l] and [2] in a finite dimensional case |42| . Next we illustrate how 
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specific counts for quadratic eigenvalue pencils can be derived from Theorems [3] 
and[H 

< Example 1. Consider the quadratic Hermitian matrix pencil C(X), C : R — > 



£(A) = M + XK + A 2 I , 



M* = M, K* = K . 



(18) 



Then, L(X) = M + XK, p = 1, and g{X) = -A 2 , q = 2, and .g 9 
Therefore = = and Theorem [i] implies 



-1 in (17). 



2n(M) 



Y sign(A) K (A), Z^ + -Zl_= Y «(*)• 

AGct(£) 



A/0 
AGcr(£) 



The symmetry (A, A) of spectrum of £ implies 

2n = z(C) + n r (C) + 2n c (C) + n t (C) , 



(19) 



where n r (C), rii{C) and n r (C) are, respectively, the numbers of real, purely 
imaginary, and non-real non-purely-imaginary characteristic values of £(A), 
rii(C) is even. Also 

n r (C) = K(A) + ^(A)]+ Y [k + (A) + k"(A)] 

AScr(£),A>0 Ae CT (£),A<0 

= ]T sign(AMA) + 2 Y «"(A) + 2 ]T «+(A).(20) 

Ae<r(£) Aeer(£),A>0 Aeo-(£),A<0 

Finally, we denote 



n-(C):= 2 ^ k-(A), n+(£) := 2 £ k+(A) 

AGct(£),A>0 AGo-(£),A>0 



(21) 



motivated by the case of simple real characteristic values of C where n~ , respec- 
tively rt+, counts the number of characteristic values of C with the negative, 
respectively positive, Krein signature. Then ( 19 1 can be rewritten as 

2n(M) + (Z^ +Z^_) = n r (C) - n"(£) - n+(£) . 



(22) 



The symmetric case. If the eigenvalue problem C(X)u = fiu has an additional 
symmetry (A, /i) — > (—A, fi) the index theorem can be further simplified. Clearly, 
Zp + — Zq_, k~(X) — k + (X), i.e., n~(£) = and n+(£) + n~(£) = n r (C). 

Furthermore, all the numbers z(C),n r (£),n c (C) are even. A difference of ( [l9| 
and (pi yields 



z(£) 



n(M) + Zl] = n c (C) + ^ + n r T(£) 



The equation complementary to ( 23 1 with respect to ([19]) is 
z(C) 



n(M) + 



(23) 



(24) 
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In the special case of a real Hermitian M and purely imaginary Hermitian 
L, and under the assumption KerM C KerL the counts ( 23 )— ( 24 1 correspond 
to index theorems derived in [14] and 0T1. > 



4.1 Algebraic Calculation of and 



The counts ( 10 ),( 11 1, and ( 14 1 contain terms Z*, that have a simple graphical 
interpretation. However, it is more traditional to express them in an algebraic 
form that we derive in this section. Theorem [6] generalizes the relation between 
and n(D) in Theorem [l] (see [H]) that holds in the case of all Jordan blocks 
of the eigenvalue of JL of length two to the case of arbitrary structure of the 
kernel of £(Ao). 

Let C = C(X) be a real analytic self-adjoint operator pencil and let Ao 
be its characteristic value of a finite multiplicity. Denote Uq = Ker(£(Ao)), 
dim Wo = k. Then there exist k eigenvalue curves fJ-j{\) and associated eigen- 
vector branches Uj(X), 1 < j < k, of £(A) such that /i(Ao) = 0. Let us define 
for to > 



K+ := { Ml (A) 



K rn : = {/^(A) 

K° m := { Ml (A) 



1 < i < fc,A*r(Ao) = 0, < s < to - 1, M (m) (^o) > 0}, 



1 < » < fc,^ s) (A ) - 0, < s < to - 1, /i (m) (^o) < 0}, 
1 < i < k,fi\ s \\ ) = 0, < s < to}, (25) 

Then the sets and are disjoint for any given to, to > 0, and 

K° m = K m+1 U K+ +1 UK° m+1 , to > . (26) 

For a characteristic value Ao of £(A) of a finite multiplicity K® n = for m large 
enough. Then 



oo oo oo 



2ml 



z a+ = I U K ™\ = E ra - 4- = I U K L-i^ m \ = J2 \kL-i\+\K; 

m—1 m—1 m=l m—1 

Also, observe that 

oo 

n(£(A )) = |X "| , |gKer(A )| = m (|X+| + |X m |) . (27) 

m—1 

The quantities |-K"^| can be calculated as the numbers of positive (negative) 
eigenvalues of specific matrices defined in Theorem [6j Particularly for to odd, 
\K^\ count the number of eigenvector chains of Ao with positive (negative) 
Krein index. In that case the Krein index n(U, Ao) can be calculated by two 
different ways, either from the (algebraic) definition or by using the graphical 
Krein signature. 

< Example 2. Let £(A) = M + \L + A 2 be a quadratic Hermitian matrix pencil 
and let U = {vJ® , , } be a chain of root vectors of a characteristic value 
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Ao = of C(X). According to the definition of the Krein indices analogous to 
Definition [l] (see [32] for details) the quantities K^(J4, Ao) of U count the number 
of positive and negative eigenvalues of the Gramm matrix W 

Wij = {u^- l \Lu^) + (u^- 2 \u^) + ( U [ i - 1 U b '- 2] ) , 

for i,j = 1,2,3, where we formally set = 0. The characteristic polynomial 
/(A) = det(W — AI) is a cubic polynomial with negative leading order coefficient 
and with three real roots, either two positive and one negative or one positive 
and two negative. Thus 

k(U, 0) = - sign /(0) = - sign(det W) . (28) 



> 



<J Example 3. Consider the quadratic pencil in Example[2]and its characteristic 
value Ao = and the chain of root vectors U — {u^ , vP^, u^}. According to 
[42] Theorem 3.4] there exist eigenvalue and eigenvector branches /z(A), u(X) 
of ^ such that /z(0) = //(0) = /i"(0) = and u(Q) = vPl Also k(W,0) = 
sign//"(0) ^ 0. For notational ease we denote C = C(0), u = u(0), \i = /i(0), 
and analogously the derivatives (// — fj/(Q), u' — u'(0), etc.). We normalize 
(it(0),u(0)) = 1, differentiate ^ three times, and multiply it by u to obtain 

£"' \ C" u" ( u"'\ a'" 



Since C is Hermitian the last term on the left hand side of ( 29 ) vanishes. Also, 
differentiation of ^ implies Cu' + Cu = and Cu" + 2£V + C"u = 0. The 
operator C = C(0) is not invertible but C + II, where II is the orthogonal 
projection X Ker£(0), is. We denote C^ 1 = — (C + II) -1 and note that 
C^U = -n. Then 



- C" li" 

u' = C^C'u + Uu' , and ^ = C~ x L'u' + CT 1 —u + II— . (30) 

Simple algebra (see Theorem [6] for the derivation in the general case) reduces 
([29]) to 

k(U, Ao) = sign //"(0) = sign [(u, A 3 u) - (IT/, LILu')] , (31) 



where A3 is defined in ( 34 1 . In the case of the quadratic pencil 



a 3 = (M + iTr 1 ^ + l(m + irr 1 + l(m + n) _1 z,(M + n)- 1 ^ . 



> 



It contains only 
it does not require knowledge of the full Jordan 



Note. The formula (311 has important consequences. 
u = vP' and vf — u'(0), i.e. 

chain U, contrary to ( p8| . Also the term (IIm', LUu 1 ) is, generally, non- vanishing 
and since IIm' G Ker£, its value is not directly encoded in a (generic) chain IA 
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as the generalized eigenvectors are determined uniquely only up to a multiple 
of«[°l. It means that u'(0) is not just an arbitrary generalized root vector to 
but it captures an extra information Hu' that is not, in general, contained 
in uW. Thus the chain of root vectors (w(0), u'(Q), u"(0)/2) is exceptional that 
will be also confirmed in a general case of an arbitrary multiplicity. A similar 
calculation in the case of a quadratic matrix pencil can be found in [5] . 

As it was illustrated in the Example [3] the graphical approach requires a 
proper definition of the inverse of the operator £(Ao). If £(Ao) is Fredholm 
and self-adjoint Ker£(Ao) -L Ran£(Ao). The operator £(Ao) acts on the vec- 
tor Hilbert space X — Ker£(A ) © Ran£(A ). Let (vi,v 2 ) € Kcr£(A ) 
Ran£(Ao) then £(X )(vi,V2) — (0, £(Ao)«2)- The operator £(Ao) is 1-to-l on 
Ran£(Ao) and thus the operator £(Ao) +11 is invertible as (£(Ao) +Il)(wi, w 2 ) = 
(vi,£(\ )v 2 ). 

Definition 4. Let £ = £(A) be an operator pencil acting on a Hilbert space 
X with a characteristic value Xq of a finite multiplicity, and let £(Ao) has a 
Fredholm index zero. Let II be an orthogonal projection X — > Ker£(Ao). Then 
we define 

C- 1 : =-(£(Ao)+n)" 1 . (32) 

Clearly £11 = and Cr x n = -II. Also, denote D := d/dX. 

Theorem 6. Let C = C(X) be an operator pencil with the characteristic value 
Ao of a finite multiplicity, and C(Xq) of a Fredholm index zero such that the 
eigenvalues and eigenvectors of C(X) in an open real neighborhood of Xq is con- 
tained in analytic branches Hj(X), Uj(X), j = l,...,fc. Let K^, m > 0, be 
defined as above and let U$ be a matrix with vectors in Ker£(Ao) as its columns 
and define recurrently 

U m+1 := U m Ker {U* m H m+1 U m ) , (33) 
where the operator FL m , m > 1 is defined as H m := A m + T> m . Here 

A m := E C^(Xo)^^o)^ £ - 1 C^M i (34) 

\a\—m 

V m := Y, D ai UA a2 UD a \ (35) 

\a\—m 

where the multi-index a = (oil, ■ ■ ■ , a s ) has positive integer entries and its norm 
is calculated as \a\ = X)i=i a i- Then for m > 1 

\ K m\ = P ( U *n-1 H m U m -i) , \K~\ = n (U^ 1 H m U m - X ) , U^H m+1 U m+1 = . 



Proof. We prove Theorem [6] by mathematical induction for m > 1. Without 
the loss of generality assume Aq = 0. 
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First, let m — 1. Let fi t e K$. Then 

(£(A)- M i(A)K(A)=0. 



(36) 



Differentiation of ( 36 1 with respect to A at A = Ao where jtij (0) = and a scalar 
product with Uj, Uj £ Kq yields 

(u j ,(£'-»' i )u i ) + (u j ,£u' i )=0. (37) 

where for a notational ease we drop the argument of £, (i and u and their 
derivatives. The second term in (37) vanishes as {uj,Cu[) = (jCuj,^) = 0. 
Therefore 



(uj,C'ui) = n'i (uj,Ui) = fi'^ , 



(38) 



with Sij = 1 for i = j and 5ij = for i ^ j. Consequently, the matrix (uj,jC'ui), 
Ui, uj € Ker £(0) for j = 1, . . . , fc, is diagonal with its eigenvalues on a diagonal. 
The number of its positive, negative and zero eigenvalues is independent of a 
choice of basis of Ker£(0) and it is the signature of the quadratic form (-,£'•) 
on Ker£(0). Since Ai = £'(0) and T>\ = we derived 

\K+ 1 = P (U* H x U ) , \K~\=n (U* H x U ) . 

Let /i G Ker(t/ * ffj 17 ). Then C/ * iJi C/ /i = and if v e U Kcr(C/ * H x U ) = U x 
also C/ * iJi J7i = 0. 

Now assume that the statement of the theorem holds for all j, j < to. Let 
fii G Then differentiation of (36) with respect to A m-times at A = 

where /Xj(0) = • •• = ^| m_1 ^(0) = gives 



E 



£(m-j) t?) 



(m) 



(m-j)! j! 



Also, 



E fg 

3=0 y 



i)! i! 







(39) 



(40) 



for s = 1, . . . , m— 1. Adding the term IIi4 /s! to both sides of (40 1 and inverting 
the operator (£ + II) yields an expression for u[ s \ Now we rewrite (39) as 

,U) „(m) 



£(m) 

(m)! 



Mi 



™ £(m-j) 

^ (m- j')! j! 



(41) 



and recursively express each term uf' /s\, 1 < s < to, that does not contain 
the projection operator IT until all the terms in the sum on the right hand 
side contain either u$ or a projection operator LI. Since the total number of 
derivatives of £ and it; at A = in each term is equal to, and all possible 
decompositions of m in to a sum are used, equation (41) reduces to 

„,( s ) „,( m ) 



m— 1 

E 

s=l 



A, 



,n- 



c- 



(42) 
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Multiplying p2| by uj € yields 



,{m) 



(43) 



Since Uj € it can be also expressed for any p < m as (compare to (42 1 ) 



ApUj 



p-1 (s) (p) 

Y]A p _ s n^ r + £^ r = o. 



(44) 



Then each individual term in the summand in the second term on the left-hand 
side can be reduced 

u u 



m—s — 1 



u [ F ] u [s) m - s - 1 



ELL- II,, Uj „• 7 /, . w * 

r! s! p! s! z — ' 



4 r) M (s) . 

r n^,n-" 



r 



Therefore using the fact (uj , ■C M -j-) = equation (43) can be rewritten as 



(uj,H m Ui) = (u v A m Ui) + (uj,V m Ui) 



(to) 



(45) 



Therefore the matrix U m *H m U m is diagonal and 

=p(U 7 * n H m U m ) , \K m+1 \ = n {U* m H m U m ) , \K° m+l \ = * {U* m H m U m ) . 



Multiplication of (45 1 by h e Ker([/^7f m t/ m ) gives V^E^^Jn = that implies 
U* n H rn U m+ i = 0. □ 



Note. According to (35) we have T>i — T> 2 — 0. Also, on the left hand side 
of (24) are two terms z(£) and that are connected with the properties of 
(generalized) kernel of C. It is easy to see that under an assumption Ker(M) C 
Ker(L) one has z(C) — 2 dim(Ker(M)) that leads to the simplified expression 
in <|8b (see 41 for the details). 



5 Conclusions 

We presented a unifying view of the index theorems frequently used across vari- 
ous fields. Furthermore, we demonstrated a special property of the chain of root 
vectors generated by the graphical method that allowed us to derive formulae for 
the number of eigenvalue curves of the eigenvalue problem C(X)u = \iu entering 
the lower half-plane of the plane (fi, A) through the characteristic value Ao of 
C(X) that did not require knowledge of the full chain of the root vectors. Both 
these results demonstrate the extraordinary beauty and power of the graphi- 
cal approach. Let us conclude by a quote from [B]: "Eigencurves (produced 
by the graphical approach) seem to provide a very useful tool in a variety of 
circumstances, and their theory and applications are quite underdeveloped", a 
statement that certainly remains true even today. 
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